Perpetual Learning for Non-Cooperative Multiple Agents

نویسنده

Luke Dickens

چکیده

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic games. These sequences of combined agent strategies (joint-policies) can be thought of as a walk through the space of all possible joint-policies. We argue that this walk, while containing random elements, is also driven by each agent’s drive to improve their current situation at each point, and posit a learning pressure field across policy space to represent this drive. Different learning choices may skew this learning pressure, and affect the simultaneous joint learning

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Agent Reinforcement Learning for Planning and Scheduling Multiple Goals

Recently, reinforcement learning has been proposed as an effective method for knowledge acquisition of the multiagent systems. However, most researches on multiagent system applying a reinforcement learning algorithm focus on the method to reduce complexity due to the existence of multiple agents[4] and goals[8]. Though these pre-defined structures succeeded in putting down the undesirable effe...

متن کامل

Inconsistency-Induced Learning for Perpetual Learners

One of the long-term research goals in machine learning is how to build never-ending learners. The state-of-the-practice in the field of machine learning thus far is still dominated by the one-time learner paradigm: some learning algorithm is utilized on data sets to produce certain model or target function, and then the learner is put away and the model or function is put to work. Such a learn...

متن کامل

Cooperative Control of Multiple Quadrotors for Transporting a Common Payload

This paper investigates the problem of controlling a team of Quadrotors that cooperatively transport a common payload. The main contribution of this study is to propose a cooperative control algorithm based on a decentralized algorithm. This strategy is comprised of two main steps: the first one is calculating the basic control vectors for each Quadrotor using Moore–Penrose theory aiming at coo...

متن کامل

University of London Imperial College of Science , Technology and Medicine Department of Computing Learning to Act Stochastically

This thesis examines reinforcement learning for stochastic control processes with single and multiple agents, where either the learning outcomes are stochastic policies or learning is perpetual and within the domain of stochastic policies. In this context, a policy is a strategy for processing environmental outputs (called observations) and subsequently generating a response or input-signal to ...

متن کامل

Multi-goal Q-learning of cooperative teams

This paper studies a multi-goal Q-learning algorithm of cooperative teams. Member of the cooperative teams is simulated by an agent. In the virtual cooperative team, agents adapt its knowledge according to cooperative principles. The multi-goal Q-learning algorithm is approached to the multiple learning goals. In the virtual team, agents learn what knowledge to adopt and how much to learn (choo...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Perpetual Learning for Non-Cooperative Multiple Agents

نویسنده

چکیده

منابع مشابه

Multi-Agent Reinforcement Learning for Planning and Scheduling Multiple Goals

Inconsistency-Induced Learning for Perpetual Learners

Cooperative Control of Multiple Quadrotors for Transporting a Common Payload

University of London Imperial College of Science , Technology and Medicine Department of Computing Learning to Act Stochastically

Multi-goal Q-learning of cooperative teams

عنوان ژورنال:

اشتراک گذاری